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Fig. 4A, 



10 30 SO 

• • • • • • 

GAATTCCATGTTCTTCTCCTTGCATGTGGCCC 
CTTAAGGTACAAGAAGAGGAACGTACACCGGGCG 

70 90 110 

CGCCCGCAGAGGGCGCATGTCCGGGTGCCTGGATATGGCGCGTACGGCGTGCCCTCCGTC 
GCGGGCGTCTCCCGCGTACAGGCCCACGGACCTATACCGCGCATGCCGC^ 

130 150 170 

OTTAACCCCGAGGTTGGCCACGATGCCCCGGCCATCAGGTCTGGAA 
CAATTGGGGCTCCAACCGGTGCTACGGGGCCGGTAGTCCAGACCTTACGATCGCAAGG 

190 210 230 

ACGAAGGTAACCCACAGTGACTCACACCACAAGTACTAGAA 
TGCTTCCATTGGGTGTCACTGAGTGTGGTGTTCATGATCTTACGTO 

250 270 290 

* * • • • • 
CGCCGCGGGATAAGGGGGAGCCATGTCCGGGACGCCGACGGAAAGCCTGACTCGATGACC 
GCGGCGCCGTATTCCCCCTCGGTACAGGCCCTGCGGCTGCCTTTCGGACTGAGCTACTTC 

M T 

310 330 350 

ACCACCGACACCGGCCCCAAGCCGGGCAGTGAGGCCGCCGCCCTGCT 
TGGTGGCTGTGGCCGGGGTTCGGCCCGTCACTCCGGCGGCGGGACGAGCGGTTACAGGCG 
T TDT GPKPGSEAAALLANVR 

370 390 410 

ACCTCGGGGGCGCGGCTGTCCTCCGCGTTGTACGAC^ 

TGGAGCCCCCGCGCCGACAGGAGGCGCAACATGCTGTAAGACTTCTTG 

TSGARLSSALYDILKNRLLE 

430 450 470 

GGGCGCTATGCGGCAGGCGAGAAGATCGTCGTCGAGTCGATCCG 

CCCGCGATACGCCGTCCGCTCTTCTAGCAGCAGCTCAGCTAGGCCGTTCTC 

GRYAAGEKIVVESIRQEFGV 

490 510 530 

AGCAAGCAGCCCGTCATGGACGCTCTGCGCCGCCTGTC 

TCGTTCGTCGGGCAGTACCTGCGAGACGCGGCGGACAGGTCGCTGTTC 

SKQPVMDALRRLSSDKLVHI 

550 570 590 

GTTCCCCAGGTCGGTTGCGAGGTCGTCTCCrACG^ 

CAAGGGGTCCAGCCAACGCTCCAGCAGAGGATGCGGGGCGCGCTTGAC 

VPQVGCEVVSYAPREVEDFY 

610 630 650 



ACCCTGTTCGGCGGTTTCGAAGGGACCATCGCCGCGGTAGCGGCCTCCC 



TGGGACAAGCCGCCAAAGCTTCCCTGGTAGCGGCGCCATCGCCGGAGGGCCGCCTGGCTC 
TLFGGFEGTIAAVAASRRTE 

670 690 710 

GCCCAGTTGCTGGAGCTGGACCTGATCTCGGCGCGGGTCGACGCCCTGATCACCTCCCAC 
CGGGTCAACGACCTCGACCTGGACTAGAGCCGCGCCCAGCTGCGGGACTAGTGGAGGGTO 
AQLLELDLISARVDALITSH 

730 750 770 

GACCCGGTGGTCCGCGCCCGCGGGTACCGCGTGCACAACCGGGAGTTCCATGCGGCCATC 
CTGGGCCACCAGGCGCGGGCGCCCATGGCGCACGTGTTGGCCCTCAAGGTACGCCGGTAG 
DPVVRARGYRVHNREFHAAI 

790 810 830 

CACGCGATGGCGCACTCGCGGATCATGGAGGAGACCAGCCAGCGAATGTGGGATCTGTCG 
GTGCGCTACCGCGTGAGCGCCTAGTACCTCCTCTGGTCGGTCGCTTACACCCTAGACAGC 
H AMAHSR I MEET SQRMWDL S 

850 870 890 

GACTTCTTGATCAACACCACCGGCATCACCAACCCGCTCTCGAGCGCACTCCCCGACCGG 
CTGAAGAACTAGTTGTGGTGGCCGTAGTGGTTGGGCGAGAGCTCGCGTGACGGGCTGGCC 
DFLINTTGITNPLSSAL PDR 

910 930 950 

CAGCATGACCACCACGAAATCACCGAGGCCATCCGCAACCGTGACGCAGCTGCCGCCCGC 
GTCGTACTGGTGGTGCTTTAGTGGCTCCGGTAGGCGTTGGCACTGCGTCGACGGCGGGCG 
QHDHHEITE AIRNRDAAAAR 

970 990 1010 

GAGGCCATGGAACGCCACATCGTCGGCACCATCGCAGTAATCCGCGACGAATCCAACGCC 
CTCCGGTACCTTGCGGTGTAGCAGCCGTGGTAGCGTCATTAGGCGCTGCTTAGGTTGCGG 
EAMERHIVGT. IAVIRDESNA 

1030 1050 1070 

CAGCTGCCGAGCTAGACCCCGATACCCGGGCCATCGACCGGCTCCGCTATCGCGCCACCT 
GTCGACGGCTCGATCTGGGGCTATGGGCCCGGTAGCTGGCCGAGGCGATAGCGCGGTGGA 

Q L P S * 

1090 1110 1130 

ACGCCGAGGGGGGACTCTCGGCCGTAGCGCTGCAGACGATCCACCGGCACCCTCCACGCT 
TGCGGCTCCCCCCTGAGAGCCGGCATCGCGACGTCTGCTAGGTGGCCGTGGGAGGTGCGA 

1150 1170 1190 

GACCCCTGTCTCGCCCTAGAGGGCCGGCGCGCCGTCGATCACCTTTACCCTCATCCAGAG 
CTGGGGACAGAGCGGGATCTCCCGGCCGCGCGGCAGCTAGTGGAAATGGGAGTAGGTCTC 

1210 1230 1250 

ACTTGCGTCACCCTCTATGCCCGAGTAGCGTCTGAACTAGACGTCTAGCATTCTAGTTGA 
TGAACGCAGTGGGAGATACGGGCTCATCGCAGACTTGATCTGCAGATCGTAAGATCA 

1270 1290 1310 

GTGCTCCCTCTCGAAGATTCTCCAGAGAACCCCTCTCGAACATCCCCAGAAGAAAGGAGC 





CACGAGGGAGAGCTTCTAAGAGGTCTCTTGGGGAGAGCTTGTAGGGGTCT^ 
1330 1350 1370 

CATGACGACCGCTTCGCACGCATCGTCCTTCGGGGCACGAGCCCACTTCCGCCCACA 
bcGGTACTGCTGGCGAAGCGTGCGTAGCAGGAAGCCCCGTGCTCGGGTGAAGGCGGGTGT 



to 



1390 



1410 



1430 



GATCGGGGAAGCCCGACCGTGAGCACCACACCTACCTCCCCGACGAAGACCTCACCGCTG 
CTAGCCCCTTCGGGCTGGCACTCGTGGTGTGGATGGAGGGGCTGCTTCTGGAGTGGCGAC 



1450 



1470 



1490 



CGGGTAGCX^TGGCCAGCTTCATCGGTACCACCGTCGAGTACTACGACTTCTTCATCTAC 
GCCCATCGCTACCGGTCGAAGTAGCCATGGTGGCAGCTCATGATGCTGAAGAAGTAGATG 
MASFIGTTVEYYDFFIY 



1510 



1530 



1550 



C^CACCGCGGCCGCGCTGGTATTCCCTGAGTTGTTCTTCCCGGATGTCTCGTCCGCGATC 
CCGTGGCGCCGGCGCGACCATAAGGGACTCAACAAGAAGGGCCTACAGAGCAGGCGCTAG 
GTAAALVFPELFFPDVSSAI 



1570 



1590 



1610 



GGAATCCTGTTGTCGTTCGCGACCTTCAGCGTTGGGTTCCTC 

CCTTAGGACAACAGCAAGCGCTGGAAGTCGCAACCCAAGGAGCGGGCGGGCGACCCACCG 
GILLSFATFSVGFLARPLGG 



1630 



1650 



1670 



ATAGTGTTCGGGCACTTCGGTGACCGGGTCGGCCGCAAGCAGATGCTCGTGATCTCCCTG 
TATCACAAGCCCGTGAAGCCACTGGCGCAGCCGGCGTTCGTCTACGACCACTAGAGGGAC 
IVFGHFGDRVGRKQMLVISL 



1690 



1710 



1730 



GTCGGAATGGGCTCGGCCACCGTACTGATGGGATTGTTGCCCGGTTACGCCCAAATCGGG 
CAGCCTTACCCGAGCCGGTGGCATGACTACCCTAACAACGGGCCAATGCGGGTTTAGCCC 
VGMGSAT-VLMGLLPGYAQIG 



1750 



1770 



1790 



ATCGCCGCCCCCATCCTGCTGACCCTGCTGC^ 

TAGCGGCGGGGGTAGGACGACTGGGACGACGCGGACCACGTCCCGAAACGGCAGCCGCCG 
I AAP I LLTLLRLVQGFAVGG 



1810 



1830 



1850 



GAGTGGGGTGGAGCCACCCTGATGGCCGTCGAGCACGCCCCCACCGCGAAGAAGGGCTTT 
CTCACCCCACCTCGGTGGGACTACCGGCAGCTCGTGCGGGGGTGGCGCTTCTTCCCGAAA 
EWGGATLMAVEHA PTAKK GF 



1870 



1890 



1910 



TTCGGATCCTTCTCCCAGATGGGGGCACCCGCCGGGACCAGCGTCGCAACCCTGGCGTTC 
AAGCCTAGGAAGAGGGTCTACCCCCGTGGGCGGCCCTGGTCGCAGCGTTGGGACCGCAAG 
FGSFSQMGAPAGTSVATLAF 



1930 



1950 



1970 



TTCGCGGTCTCCCAATTGCCCGAC GAGCAGT TCCTGAGTTGGGGCTGGCGACTGCCGTTC 



33 

s o 

^ m 

§ rn 
o 



AAGCGCCAGAGGGTTAACGGGCTGCTCGTCAAGGACTCAACCCCGACCGCTGACGGCAAG 
FAVSQLPDEQFLSWGWRLPF 

1990 2010 2030 

CTGTTCAGCGCGGTGCTCATCGTGATCGGGCTGTTCATTCGCCTC 

GACAAGTCGCGCCACGACTAGCACTAGCCCGACAAGTAAGCGGACAGGGACCGGCTTTCG 
LFSAVLIVIGLFIRLSLAES 

2050 2070 2090 

CCCGACTTCGCCGAGGTGAAGGCACAGAGCGCCGTGGTGCGAATGCCGATCGCCGAAGCG 
GGGCTGAAGCGGCTCCACTTCCGTGTCTCGCGGCACCACGCTTACGGCTAGCGGCTTCGC 
PDFAEVKAQSAVVRM P1AEA 

2110 2130 2150 

TTCCGCAAGCACTGGAAGGAAATTCTCCTCATCGCGGGCACCTACCTGTCCCAAGGAGTG 
AAGGCGTTCGTGACCTTCCTTTAAGAGGAGTAGCGCCCGTGGATGGACAGGGTTCCTCAC 
FRKHWKEILLIAGTYLSQGV 

2170 2190 2210 

TTCGCCTATATCTGCATGGCCTACCTCGTCTCCTACGGCACCACCGTCGCGGGGATCAGC 
AAGCGGATATAGACGTACCGGATGGAGCAGAGGATGCCGTGGTGGCAGCGCCCCTAGTCG 
FAY ICMAYLV SYGTTVAG I S 

2230 2250 2270 

CGCACCTTCGCCCTGGCCGGAGTATTCGTCGCCGGCATCGTCGCCGTCCTCCTCTACCTC 
GCGTGGAAGCGGGACCGGCCTCATAAGCAGCGGCCGTAGCAGCGGCAGGAGGAGATGGAG 
RTFALAGVFVAGIVAVLLYL 

2290 2310 2330 

GTGTTCGGCGCTCTGTCCGACACTTTCGGCCGCAAGACCATGTACCTGCTCGGCGCCGCC 
CACAAGCCGCGAGACAGGCTGTGAAAGCCGGCGTTCTGGTACATGGACGAGCCGCGGCGG 
VFGALSDTFGRKTMYLLGAA 

2350 2370 2390 

GCGATGGGTGTGGTGATCGCCCCCGCCTTCGCACTGATCAACACC 

CGCTACCCACACCACTAGCGGGGGCGGAAGCGTGACTAGTTGTGGCCGTTGGGCACCGAC 
AMGVVIAPAFALINTGNPWL 

2410 2430 2450 

TTCATGGCCGCGCAGGTGCTGGTCTTCGGAATTGCAATGGCCCCCGCCGCCGGCGTGACA 
AAGTACCGGCGCGTCCACGACCAGAAGCCTTAACGTTACCGGGGGCGGCGGCCGCACTGT 
FMAAQVLVFGIAMAPAAGVT 

2470 2490 2510 

GGCTCCCTGTOCACGATGGTCTTCGACGCGGACGTGCGCTACAGCGGTGTCTCTATCGGC 
CCGAGGGACAAGTGCTACCAGAAGCTGCGCCTGCACGCGATGTCGCCACAGAGATAGCCG 
GSLFTMVFDADVRYSGVSIG 

2530 2550 2570 

TACACCATCTCCCAGGTCGCCGGCTCCGCGTTCGCCCCGACGATCGCGACCGCCTTGTAC 
ATGTGGTAGAGGGTCCAGCGGCCGAGGCGCAAGCGGGGCTGCTAGCGCTGGCGGAACATG 
YTI SQ VAG S AFAP TIATALY 



2590 2610 2630 

GCCTCCACCAACACCAGCAACTCGATCGTGACCTACCTGCTGATCGTCTCGGCCATCTCG 
CGGAGGTGGTTGTGGTCGTTGAGCTAGCACTGGATGGACGACTAGCAGAGCCGGTAGAGC 
ASTNTSNSIVTYLLIVSAIS 

2650 2670 2690 

ATCGTCTCGGTGATCCTGCTGCCCGGCGGCTGGGGGCGCAAGGGCGCTGCGAGCCAGCTC 
TAGCAGAGCCACTAGGACGACGGGCCGCCGACCCCCGCGTTCCCGCGACGCTCGGTCGAG 
I V S V I LLPGGWGRKGAASQL 

2710 2730 2750 

• • • • * * 

ACTCGCGACCAGGCCACCTCCACACCGAAAATGCCTGACACCGAAACATTTTCGACTCGG 
TGAGCGCTGGTCGGGTGGAGGTGTGGCTTTTACGGACTGTGGCTTTG 
TRDQATSTPKMPDTETFSTR 

2770 2790 2810 

ACAGTTCCGGACACCGCAGCATCCCTGCGCGTCCTCGACAAGTGAAGTGATGACAGACAT 
TGTCAAGGCCTGTGGCGTCGTAGGGACGCGCAGGAGCTGTTCACTTCACTACTC 
TVPDTAAS LRVLDK* MTDM 

2830 2850 2870 

. . • * • * 

GAGTGACCACGACCGCACCTCCTACGACACCGACGTCGTGATCGTCGGCCTCGGCCCCGC 
CTCACTGGTGCTGGCGTGGAGGATGCTGTGGCTGCAGCACTAGCAGCCGGAGCCGGGGCG 
SDHDRTSYDTDVVIVGLGPA 

2890 2910 2930 

CGGTGGCACAGCGGCGCTTGCCCTGGCCAGCTACGGCATCCGCGTTCACGCCGTCTCGAT 
GCCACCGTGTCGCCGCGAACGGGACCGGTCGATGCCGTAGGCGCAAGTGCGGCAGAGCTA 
GGTAALALASYG IRVHAVSM 

2950 2970 2990 

GTTCCCCTGGGTGGCGAACTCGCCGCGCGCGCACATCACCAACCAGCGCGCCGTCGAAGT 
CAAGGGGACCCACCGCTTGAGCGGCGCGCGCGTGTAGTGGTTGGTCGCGCGGCAGCTTCA 
F PWVANSPRAHI TNQRAVEV 

3010 3030 3050 

GCTGCGTGACCTGGGCGTCGAAGACGAGGCGCGCAACTACGCCACCCCGTGGGACCAGAT 
CGACGCACTGGACCCGCAGCTTCTGCTCCGCGCGTTGATGCGGTGGGGCACCCTGGTCTA 
LRDLGVEDEARNYATPWDQM 

3070 3090 3110 

• • • • * * 

GGGCGACACGCTGTTCACCACGAGCCTGGCCGGCGAGGAGATCGTCCGGATGCAGACCTG 
CCCGCTGTGCGACAAGTGGTGCTCGGACCGGCCGCTCCTCTAGCAGGCCTACGTCTGGAC 
GDTLFTTSLAGEEIVRMQTW 

3130 3150 3170 

GGGTACGGGCGATATCCGCTACGGGGACTACCTGTCCGGAAGCCCCTGCACGATGCTCGA 
CCCATGCCCGCTATAGGCGATGCCCCTGATGGACAGGCCTTCGGGGACGTGCTACGAGCT 
GTGDIRYGDYLSGS PCTMLD 

3190 3210 3230 

CATTCCGCAGCCCCTGATGGAGCCGGTGCTGATCAAGAACGCCGCCGAACGTGGTGCGGT 



GTAAGGCGTCGGGGACTACCTCGGCCACGACTAGTTCTTGCGGCGGCTTGCACCACGCCA 
I PQPLMEPVL IKNAAERGAV 

3250. 3270 3290 

CATCAGCTTCAACACCGAATACCTCGACCACGCCCAGGACGAGGACGGGQTGACCGTCCG 
GTAGTCGAAGTTGTGGCTTATGGAGCTGGTGCGGGTCCTGCTCCTGCCCCACTGGCAGGC 
I SPNTEYLDHAQDEDGVTVR 

3310 3330 3350 

GTTCCGCGACGTCCGCTCGGGCACCGTGTTCACCCAGCGAGCCCGCTTCC^ 
CAAGGCGCTGCAGGCGAGCCCGTGGCACAAGTGGGTCGCTCGGGCGAAGGACGAGCCAAA 
FRDVRSGTVFTQRARFLLGF 

3370 3390 3410 

CGACGGCGCACGATCGAAGATCGCCGAACAGATCGGGCTTCCGTTCGAAGGTGAACTC^ 
GCTGCCGCGTGCTAGCTTCTAGCGGCTTGTCTAGCCCGAAGGCAAGCTTCCACTTGAGCG 
DGARSKIAEQIGLPFEGELA 

3430 3450 3470 

CCGCGCCGGTACCGCGTACATCCTGTTCAACGCGGACCTGAGCAAATATGTCGCTCATCG 
GGCGCGGCCATGGCGCATGTAGGACAAGTTGCGCCTGGACTCGTTTATACAGCGAGTAGC 
RAGTAYI LFNADLSKYVAHR 

3490 3510 3530 

GCCGAGCATCTTGCACTGGATCGTCAACTCGAAGGCCGGTTTCGGTGAGATCGGCATGGG 
CGGCTCGTAGAACGTGACCTAGCAGTTGAGCTTCCGGCGAAAGCCACTCTAGCCGTACCC 
PS I LHWIVNSKAGFGEIGMG 

3550 3570 3590 

TCTGCTGCGCGCGATCCGACCGTGGGACCAGTGGATCGCCGGCTGGGGCTTCGACATGGC 
AGACGACGCGCGCTAGGCTGGCACCCTGGTCACCTAGCGGCCGACCCCGAAGCTGTACCG 
LLRAI RPWDQWI AGW GFDMA 

3610 3630 3650 

GAACGGCGA^CGGATGTCTCCGACGACGTTGTCCTCGAACAGATCCGGACCCTCGTCGG 
CTTGCCGCTCGGCCTACAGAGGCTGCTGCAACAGGAGCTTGTCTAGGCCTGGGAGCAGCC 
NGE PDVSDD VVLEQI RTLVG 

3670 3690 3710 

CGACCCGCACCTGGACGTCGAGATCGTGTCGAGGTCCTTCTGGTACGTCAACCGGCAGTG 
GCTGGGCGTGGACCTGCAGCTCTAGCACAGCTCCAGGAAGACCATGCAGTTGGCCGTCAC 
DPHLDVEIVSRS FWYVMRQW 

3730 3750 3770 

GGCTGAGCACTACCAGTCCGGTCGAGTGTTCTGCGGCGGCGACGCGGTGCACCGGCATCC 
CCGACTCGTGATGGTCAGGCCAGCTCACAAGACGCCGCCGCTGCGCCACGTGGCCGTAGG 
AEHYQSGRVFCGG DAVHRHP 

3790 3810 3830 

GCCGAGCAGCGGGCTGGGCTCGAACACGTCCATGCAGGACGCGTTCAACCTGGCATGGAA 
CGGCTCGTCGCCCGACCCGAGCTTGTQCAGGTACGTCCTGCGCAAGTTGGACCGTACCTO 
PSSGLGSNT S M QDAFNLAWK 




3850 3870 3890 

. • • • • * 

GATCGCGTTCGTCGTGAAGGGGTATGCAGGACCGGGTCTGCTCGAGTCCTACTCTCCTGA 
CTAGCGCAAGCAGCAC1TCCCCATACGTCCTGGCCCAGACGAGCTCAGGATGAGAGGACT 
IAFVVKGYAGPGLLESYSPE 

3910 3930 3950 

GCGTGTTCCGGTCGGCAAACAGATCGTCGCTCGCGCCAACCAGTCCCGCAAGGACTA 
CGCACAAGGCCAGCCGTTTGTCTAGCAGCGAGCGCGGTTGGTCAGGGCGTTCCTGATGCG 
RVPVGKQIVARANQSRKDYA 

3970 3990 4010 

CGGGCTGCGCGAATGGTTCGATCACGAGAGCGACGACCCGGTCGCCGCCGGCCTGGCAAA 
GCCCGACGCGCTTACCAAGCTAGTGCTCTCGCTGCTGGGCCAGCGGCGGCCGGACCGTTT 
GLREWFDHESDD PVAAGLAK 

4030 4050 4070 

• 

GTTGAAGGAACCCTCGTCCGAAGGTGTTGCTCTGCGTGAGCGGCTGTACGAGGCGCTGGA 
CAACTTCCTTGGGAGCAGGCTTCCACAACGAGACGCACTCGCCGACATGCTCCGCGACCT 
LKEPSSEGVALRERLYEALE 

4090 4110 4130 

s • » • * • 

GGTGAAGAACGCCGAATTCAACGCCCAGGGCGTCGAACTCAACCAGCGCTACACCTCGTC 
CCACTTCTTGCGGCTTAAGTTGCGGGTCCCGCAGCTTGAGTTGGTCGCGATGTGGAGCAG 
VKNAEFNAQGVELNQRYTSS 

4150 4170 4190 

• • * * * * 

CGCGGTCGTTCCCGACCCCGAGGCGGGCGAGGAAGTGTGGGTGCGCGATCGTGAGCTGTA 
GCGCCAGCAAGGGCTGGGGCTCCGCCCGCTCCTTCACACCCACGCGCTAGCACTCGACAT 
A V V P D P EAGE EVWVRD R ELY 

4210 4230 4250 

CCTGCAGGCCACCACCCGGCCGGGCGCGAAGCTGCCGCATGCGTGGCTGGTCGGCGCCGA 
GGACGTCCGGTGGTGGGCCGGCCCGCGCTTCGACGGCGTACGCACCGACCAGCCGCGGCT 
LQAT TRPGAKLPHAWLVGAD 

4270 4290 4310 

CGGAACCCGCATCTCCACCCTCGACGTCACCGGCAAGGGAATGATGACCCTGCTGACCGG 
GCCTTGGGCGTAGAGGTGGGAGCTGCAGTGGCCGTTCCCTTACTACTGGGACGACTGGCC 
GTRI STLDVTGKGMMTLLTG 

4330 4350 4370 

ACTCGGCGGCCAGGCATGGAAGCGTGCCGCCGCCAAACTCGACCTGCCGTTCCTGCGGAC 
TGAGCCGCCGGTCCGTACCTTCGCACGGCGGCGGTTTGAGCTGGACGGCAAGGACGCCTG 
LGGQAWKRAAAKLDLPFLRT 

4390 4410 4430 

CGTCGTTGTCGGCGAACCCGGCACCATCGACCCTTACGGATACTGGCGGCGGGTCCGCGA 
GCAGCAACAGCCGCTTGGGCCGTGGTAGCTGGGAATGCCTATGACCGCCGCCCAGGCGCT 
VVVGE pGTIDPYGYWRRVRD 

4450 4470 4490 

CATCGACGAGGCCGGCGCCCTGCTCGTGCGGCCCGACGGCTACGTCGCGTGGCGACACAG 




GTAGCTGCTCCGGCCGCGGGACGAGCACGCCGGGCTGCCGATGCAGCGCACCGCTGTGTC 
"iDEAGALLVRPDGYVAWRHS 

4510 4530 4550 

TGCTCCGGTCTGGGACGACACCGAAGCGCTCACCAGCCTCGAGAACGCTCTCACCGCGGT 
ACGAGGCCAGACCCTGCTGTGGCTTCGCGAGTGGTCGGAGCTCTTGCGAGAGTGGCGCCA 
APVWDDTEALTS LENALTAV 

4570 4590 4610 

CCTCGACCACTCGGCCAGCGACAACGGGAACCCGAGCGGCACAAACGAGCCGCAGTACAG 

GGAGCTGGTGAGCCGGTCGCTGTTGCCCTTGGGCTCGC^ 
LDHSASDNGNPSGTNE PQ YS 

4630 4650 4670 

CACCCGGGCCGTGCCGATCGTCGTTCCGCACGTTACCGCCGAGGATGCAGCACCAGCTTC 
GTGGGCCCGGCACGGCTAGCAGCAAGGCGTGCAATGGCGGCTCCTACGTCGTGGTCGAAG 
TRAVP IVVPHVTAEDAAPAS 

4690 4710 4730 

CGCCACCCGCACCACCACAGTCGAGGGAGAGAACCGATGACCCGTCCTTACACCAGCGTC 
GCGGTGGGCGTGGTGGTGTCAGCTCCCTC^ 
ATRTTTVEGENR* 

MTRPYTSV 

4750 4770 4790 

TGGGACGACCTGAACCAGGTCGAGTTCAGCCAGGGATTCATCCAGGCCGGCCCCTACCGG 
ACCCTGCTGGACTTGGTCCAGCTCAAGTCGGTCCCTAAGTAGGTCCGGCCGGGGATGGCC 
WDDLNQVEFSQGF IQAGPYR 

4810 4830 4850 

ACCCGATACCTGCACGCCGGCGATTCGTCCAAGCCCACGCTGATCCTGCTGCACGGCATC 
TGGGCTATGGACGTGCGGCCGCTAAGCAGGTTCGGGTGCGACTAGGACGACGTGCCGTAG 
TRYLHAGDSSKPTLILLHGI 

4870 4890 4910 

ACCGGCCACTCCGAGGCGTACGTGCGCAATCTGCGCTCGCATTCCGAGCACTTCAACGTC 
TGGCCGGTGCGGCTCCGCATGCACGCGTTAGACGCGAGCGTAAGGCTCGTGAAGTTGCAG 
TGHAEAYVRNLRSHSEHFNV 

4930 4950 4970 

TGGGCAATCGACTTCATCGGCCACGGCTATTCGACCAAGCCCGACCACC03CTCGAGATC 
ACCCGTTAGCTGAAGTAGCCGGTGCCGATAAGCTGGTTCGGGCTGGTGGGCGAGCTCTAG 
WAIDFIGHGYSTKPDHPLEI 

4990 5010 5030 

AAGCACTACATCGACCACGTGCTGCAGTTGCTGGACGCCATCGGCGTCGAGAAGGCCTCG 
TTCGTGATGTAGCTGGTGCACGACGTCAACGACCTGCGGTAGCCGCAGCTCTTCCGGAGC 
KHYIDHVLQLLDAIGVEKAS 

5050 5070 5090 

TTTTCCGGGGAGTCTCTCGGCGGTTGGGTCACCGCCCAGTTCG 

AAAAGGCCCCTCAGAGAGCCGCCAACCCAGTGGCGGGTCAAGCGCGTGCTGGTAGGGCTC 
FSGESL GGWVTAQ .FAHD HPE 




5110 5130 5150 

AAGGTCGACCGGATCGTGCTCAACACCATGGGCGGCACCATGGCCAACCCTCAGGTGATG 
TTCCAGCTGGCCTAGCACGAGTTGTGGTACCCGCCGTGGTACCGGTTGGGAGTCCACTAC 
KVDR I VLNTMGGTMANPQVM 

5170 5190 5210 

GAACGTCTCTATACCCTGTCGATGGAAGCGGCGAAGGACCCGAGCTGGGAACGCGTCAAA 
CTTGCAGAGATATGGGACAGCTACCTTCGCCGCTTCCTGGGCTCGACCCTTGCGCAGTTT 
ERLYTLSMEAAKDPSWERVK 

5230 5250 5270 

• • • • * • 

GCACGCCTCGAATGGCTCATGGCCGACCCGACCATGGTCACCGACGACCTGATCCGCACC 
CGTGCGGAGCTTACCGAGTACCGGCTGGGCTGGTACCAGTGGCTGCTGGACTAGGCGTGG 
ARLEWLMADPTMVTDDLIRT 

5290 5310 5330 

» • • • • 

CGCCAGGCCATCTTCCAGCAGCCGGATTGGCTCAAGGCC 

GCGGTCCGGTAGAAGGTCGTCGGCCTAACCGAGTTCCGGACGCTCTACTTGTACCGTGAC 
RQAI FQQPDWLKACEMNMAL 

5350 5370 5390 

CAGGACCTCGAAACCCGCAAGCGGAACATGATCACCGACGCCACTCTCAACGGCATCACG 
GTCCTGGAGCTTTGGGCGTTCGCCTTGTACTAGTGGCTGCGGTGAGAGTTGCCGTAGTGC 
QDLET RKRNMITDATLNG IT 

5410 5430 5450 

GTGCCCGCGATGGTGCTGTGGACCACCAAGGACCCCTCCGGTCCGGTCGACGAAGCCAAG 
CACGGGCGCTACCACGACACCTGGTGGTTCCTGGGGAGGCCAGGCCAGCTGCTTCGGTTC 
VPAMVLWTTKDPSGPVDEAK 

5470 5490 5510 

CGCATCGCCTCCCACATCCCGGGCGCCAAGCTGGCCATCATGGAGAACTGTGGCCACTGG 
GCGTAGCGGAGGGTGTAGGGCCCGCGGTTCGACCGGTAGTACCTCTTGACACCGGTGACC 
RIASH I PGAKLAI MENCGHW 

5530 5550 5570 

CCCCAGTACGAGGACCCCGAGACCTTCAACAAGCTGCATCTGGACTTCCTCGTCGGTCGC 
GGGGTCATGCTCCTGGGGCTCTGGAAGTTGTTCGACGTAGACCTGAAGGAGGAGCCAGCG 
PQYEDPETFNKLHLDFLLGR 

5590 5610 5630 

AGCTGACACAGACCCCGGCCGGTGCCGCCAACCCCTGCAACCCGGGCGGCACCGGCCGGA 
TCGACTGTGTCTGGGGCCGGCCACGGCGGTTGGGGACGTTGGGCCCGCCGTGGCCGGCCT 

S * 

5650 5670 5690 

TCTCACTTACCCGACCTATTGCGCTCTCGTCCGGACCCCCGGAGAGAAAGCGCCGAAGCA 
AGAGTGAATGGGCTGGATAACGCGAGAGCAGGCCTGGGGGCCTCTCTTTC 

5710 5730 5750 

GCAGCAAGGAGACCGCCGCGATGCCTGTAGCGCTGTGCGCGATGTCGCACTCCCCCCTGA 



CGTCGTTCCTCTGGCGGCGCTACGGACATCGCGACACGCGCTACAGCGTGAGGGGGGACT 

MPVALCAMSHSPLM 

5770 5790 5810 

,.•••* 
TGGGACGCAACGACCCCGAACAGGAAGTCATCGACGCCGTCGACGCCGCATTCGACCACG 
ACCCTGCGTTGCTGGGGCITGTCCTTCAGTAGCTGC^ 

GRNDPEQEVI DA 'VDAAFDHA 

5830 5850 5870 

CGCGCCGGTTCGTCGCCGACTTCGCCCCCGATCTCATCGTCATCTTCGCCCCCGACCACT 
GCGCGGCCAAGCAGCGGCTGAAGCGGGGGCTAGAGTAGCAGTAGAAGCGGGGGCTGGTGA 
RRFVADFAPDLI VI FAPDHY 

5890 5910 5930 

...*•* 
ACAACGGCGTCTTCTACGACCTGCTGCCGCCGTTCTGTATCGGTGCCGCCGCGCAGTCCG 
TGTTGCCGCAGAAGATGCTGGACGACGGCGGCAAGACATAGCCACGGCGGCGCGTCAGGC 
NGVFYDLLPPFC IGAAAQSV 

5950 5970 5990 

*•♦*•• 
TCGGCGACTACGGCACCGAAGCCGGCCCTCTCGACGTCGACCGTGACGCCGCCTACGCAG 
AGCCGCTGATGCCGTGGCTTCGGCCGGGAGAGCTGCAGCTGGCACTGCGGCGGATGCGTC 
GDYGTEAGP LDVDRDAAYAV 

6010 6030 6050 

• 4 - * • * 

TCGCCCGCGACGTCCTCGACAGCGGCATCGACGTCGCATTCTCCGAACGCATGCACGTCG 
AGCGGGCGCTGCAGGAGCTGTCGCCGTAGCTGCAGCGTAAGAGGCTTGCGTACGTGCAGC 
ARDVLDSGIDVAFS ERMHVD 

6070 6090 6110 

• ■ • • • • 
ACCACGGATTCGCCCAAGCACTCCAATTGCTGGTCGGATCGATCACCGCCGTGCCGACCG 
TGGTGCCTAAGCGGGTTCGTGAGGTTAACGACCAGCCTAGCTAGTGGCGGCACGGCTGGC 

HGFAQALQLLVG S ITAVPTV 

6130 6150 6170 

* * * 
TGCCGATCTTCATCAATTCGGTCGCCGAACCGCTC 

ACGGCTAGAAGTAGTTAAGCGAGCGGCTTGGCGAGCCGGGCCAGTCGGCCCATGCCGACG 
PIFINSVAEPLGPVSRVRLL 

6190 6210 6230 

• • • 

TCGGCGAGGCGGTCGGGCGGGCCGCTGCCAAGCTGGACAAGCGTGTGCTGTTCGTCGGAT 
AGCCGCTCCGCCAGCCCGCCCGGCGACGGTTCGACCTGTTCGCACACGACAAGCAGCCTA 
GEAVGRAAAKLDKRVLFVGS 

6250 6270 6290 

CCGGCGGCCTGTCCCACGACCCGCCGGTCCCGCAGTTCGCCACCGCGCCAGAGG7VAGTGC 
GGCCGCCGGACAGGGTGCTOGGCGGCCAGGGCGTCAAGCGGTGGCGCGGTCTCCTTCACG 
GGLSHDPPVPQFATAPEEVR 

6310 6330 6350 

GCGAGCGGTTGATCGACGGCCGCAATCCCAGTGCCGCCGAACGTGATGCCCGCGAACAG^ 
CGCTCGCCAACTAGCTGCCGGCGTTAGGGTCACGGCGGCTTGCACTACGGGCGCTTGTCG 
ERLI DGRNPSAAERDAREQR 
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6370 6390 6410 



GCGTCATCACCGCCGGGCGGGACTTCGCCGCCGGCACCGCCGCCATCCAGCCACTGAACC 
CGCAGTAGTGGCGGCCCGCCCTGAAGCGGCGGCCGTGGCGGCGGTAGGTCGGTGACTTGG 
VITAGRDFAAGTAAI QPLNP 

6430 6450 6470 

CCGAATGGGACCGGCACCTGCTCGACGTCCTCGCCTCCGGCGACCTCGAGCAGATCGACG 
GGCTTACCCTGGCCGTGGACGAGCTGCAGGAGCGGAGGCCGCTGGAGCTCGTCTAGCTGC 
EWDRHLLDVLASGDLEQIOA 

6490 6510 6530 

CGTGGACCAACGACTGGTTCGTCGAACAGGCCGGACACTCCTCCCACGAAGTGCGCACCT 
GCACCTCGTTGCTGACCAAGCAGCTTGTCCGGCCTGTGAGGAGGGTGCTTCACGCGTGGA 
WTNDWFVEQAGHSSHEVRTW 

6550 6570 6590 

GGATCGCCGCGTACGCGGCAATGAGCGCCGCCGGGAAGTACCGCGTCACCTCGACCTTCT 
CCTAGCGGCGCATGCGCCGTTACTCGCGGCGGCCCTTCATGGCGCA6TGGAGCTGGAA^ 
I AAYAAMSAAGKYRVTST FY 

6610 6630 6650 

ACCGCGAAATCCACGAGTGGATAGCAGGATTCGGGATTACTACCGCCGTCGCCGTCGACG 
TGGCGCTTTAGGTGCTCACCTATCGTCCTAAGCCCTAATGATGGCGGCAGCGGCAGCT^ 
REIHEWIAGFGITTAVAVDE 

6670 6690 6710 

AATAGACCCCGCCGCTCCCGCCCCGCAGTCCCAACGAAGGGTGGCCCCGGATGACCTCCG 
TTATCTGGGGCGGCGAGGGCGGGGCGTCAGGGTTGCTTCCCACCGGGGCCTACTGGAGGC 
* M T S V 

6730 6750 6770 

TCCGCCCGTGCTCGCCGTCGGTGAACGCGGGCTGGTCGGTGGGCAGGAAGACCTCATC 
AGGCGGGCACGAGCGGCAGCCACTTGCGCCCGACCAGCCACCCGTCCTTfcTGGAGTAGCG 
RPCSPSVNAGWSVGRKTSSP 

6790 6810 6830 

• • . * • • 

CGACATCGCCCTCGACCTCGCAGCTCGTCAGTAGGAATGCGCACGGGCCGACGAGTCGCG 
GCTGTAGCGGGAGCTGGAGCGTCGAGCAGTCATCCTTACGCGTGCCCGGCTGCTCAGCGC 
TSPSTSQLVSRNAHGPTSRA 

6850 6870 6890 

CTGGTCACCGGGGCCAGCCGCGGCATCGGGGCGGCCATCGCAGATGCGGTGGCCGCCTCC 
GACCAGTGGCCCCGGTCGGCGCCGTAGCCCCGCCGGTAGCGTCTACGCCACCGGCGGAGG 
GHRGQPRHRGGHRRCGGRLR 

6910 6930 6950 

GGTGCCGCCGTAATCGTCCACTACGGATCCGATCGGACGGCCGCCGCTGCGGTGTCGACG 
CCACGGCGGCATTAGCAGGTGATGCCTAGGCTAGCCTGCCGGCGGCGACGCCACAGCTGC 
CRRNRPLRI RSDGRR .CGVDG 

6970 6990 7010 

GCATCACGGCTGCCGGGGGCCTCGCGGCTGCGGTCCAGG.CCGACCTGTCCCGACCCGAGG 




CGTAGTGCCGACGGCCCCCGGAGCGCCGACGCCAGGTCCGGCTGGACAGGGCTGGGCTCC 
ITAAGGLAAAVQADLSRPEG 
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7030 



7050 



7070 



GGCCTGAAGAGCTGATGCGGGAGTTCGACTCCGCGCTCGACGGTCTCGGGCTCGACCGAG 
CCGGACTTCTCGACTACGCCCTCAAGCTGAGGCGCGAGCTGCCAGAGCCCGAGCTGGCTC 
PEELMREFDSALDGLGLDRG 



7090 7110 7130 

GGCTCGACATCCTCGTCAACAACGCCGGAATCA^ 

CCGAGCTGTAGGAGCAGTTGTTGCGGCCTTAGTCAGCCGCGCCTCGCGAGCTCGCGCAGT 
LDILVNNAGI SRRGALERVT 



7150 7170 7190 

• • * • • • 

CTGTCGAGGATTTCGACCX3TCTGGTCGCACTCAACCAGCGCGCCCCGTTCTTC 
GACAGCTCCTAAAGCTGGCAGACCAGCGTGAGTTGGTCGCGCGGGGCAAGAAGCACTGAG 
VEDFDRLVALNQRA PFFVT R 



7210 7230 7250 

• *•••• 
GGCATGCCCTGCCCCGGATGCACGACGGCGGTCGCATCGTCAACATTTCCTCCGGATCCG 
CCGTACGGGACGGGGCCTACGTGCTGCCGCCAGCGTAGCAGTTGTAAAGGAGGCCTAGGC 
HALPRMHDGGRIVNISSGSA 



7270 7290 7310 

CCCGCTACGCCAGACCCGACGTCATCAGCTACGCCATGACCAAGGGGGCGATCGAGGTGC 
GGGCGATGCGGTCTGGGCTGCAGTAGTCGATGCGGTACTGGTTCCCCCGCTAGCTCCACG 
RYARPDVISYAMTKGAIEVL 



7330 7350 7370 

TCACCCGCGCCCTCGCCGTAGACGTCGGCGAACGAGGCATCACCGCCAACGCCGTGGCGC 
AGTGGGCGCGGGAGCGGCATCTGCAGCCGCTTGCTCCGTAGTGGCGGTTGCGGCACCGCG 
TRALAVDVGERGITANAVAP 



7390 7410 7430 

• * • » • • 

CGGCCGCGCTCGATACCGACATGAACGCGCACTGGCTTCGCGGTGACGACCATGCCCGCA 
GCCGGCGCGAGCTATGGCTGTACTTGCGCGTGACCGAAGCGCCACTGCTGGTACGGGCGT 
AALDTDMNAHW LRGDDHART 

7450 7470 7490 

CCACCGCCGCGTCCACCACTGCACTGCGAAAACTCGCCACCGCGGAGGACATCGCCGCGA 
GGTGGCGGCGCAGGTGGTGACGTGACGCTTTTGAGCGGTGGCGCCTCCTGTAGCGGCGCT 
TAASTTALRKLATAEDIAAI 

7510 7530 7550 

TCGTGGCCTTCCTCGTCAGCGCCGCCGCCGGTGCGATCACCGGGCAGGTCATCGACGCCA 
AGCACCGGAAGGAGCAGTCGCGGCGGCGGCCACGCTAGTGGCCCGTCCAGTAGCTGCGGT 
VA .FLVSAAAGA ITGQVIDAT 



7570 



CCAACGGCAACCGGCTCTAACCAG 
GGTTGCCGTTGGCCGAGATTGGTC 
N G N R L * 
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